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(57) Abstract 

DNA constructs comprise a first 
exon sequence of rmdeotldes encoding a 
first peptide or polypeptide, a second exon 
sequence of nucleotides encoding a sec- 
ond peptide or polypeptide and a third 
sequence of nucleotides between the first 
and second sequences encoding a heterol- 
ogous intron, f^ example that of Tetrahy- 
mena thermophila nudear pre-iRNA, be- 
tween RNA splice sites and a site^^eciftc 
recombination sequence, such as loo^^ 
within the intron, the exons together en- 
coding a product peptide or polypeptide. 
Such ccmstructs are of use b methods 
of production of peptides or polypeptides, 
transcrq>tion leading to splicing out of 
the faitron enabling translation of a single 
Cham product peptide or polypeptide. Iso- 
lated nudeic acid constructs consisting es- 
sentially of a sequence of rmdeotldes en- 
coding a self-splidng intron with a site- 
specific recombination sequence withm 
the intron, for use in creation of constructs 
for expression of peptides or polypeptides, 
are also provided. 
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KBCGMBXmNT BINDING PROTEINS AND PEPTIDES 
The present invention relates to polypeptides 
which comprise two or more con5)onent polypeptides or 
peptides, methods for meJcing them and DNA constructs 
5 for the use in this making. In particular, it relates 
to the provision of repertoires of such polypeptides 
and encoding nucleic acid therefor. 

In this application, we describe the generation 
of binding proteins and peptides using nucleic acid 

10 containing introns with RNA splice sites such as self- 
splicing introns, preferably in conjunction with a 
site-specific recombination system, such as lox P 
(Hoess et al Proc. Natl. Acad. Sci. USA 79 3398-3402, 
1982; Stembeirg et al J. Mol. Biol. 150 467-486, 1981). 

15 The site-specific recombination allows two sequences of 
nucleic acid to be cloned separately as libraries and 
be brought together subsequently by a recombination 
event (Waterhouee et al Nucleic Acids Res. 21 2265- 
2266, 1993; A.D. Griffiths et al. EMBO J. in press; WO 

20 92/20791; WO 93/19172. One library of sequence is 

cloned into a first replicon and a second library of 
sequences into a second replicon. Recombination 
between the sites brings together libraries of both 
sequences on the same replicon. This recombination can 

25 be performed in vivo e.g. by PI infection or by using a 
recombinase encoded by a plasmid in E.coli or in vitro 
using soluble recombinase. For lox P, the recombinase 
is Ore. This allows a large library to be made where 
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the limitation is not the cloning efficiency but rather 
the number of cells which can be grown. Thus the 
method is particularly powerful in combination with 
phage display technology which allows the selection of 
5 proteins with desired binding properties from a large 
library of displayed proteins (WO 92/01047; WO 
92/20791; WO 93/06213; WO 93/11236; WO 93/19172; 
PCT/GB93/02492) . The size of the library is 
significant for ability to select antibodies or other 
10 binding proteins of appropriate affinity and 
specificity. 

WO 93/19172 describes recombining two libraries 
of nucleic acid using a site-specific e.g. lox P, 
system mainly to code for heterodimeric proteins in 

15 which two chains encoded by distinct (separate) nucleic 
acid sequences associate to form a functional binding 
site. Also described is the bringing together of two 
polypeptides for continuous open reading frames. 
However, this imposes the use of an amino acid sequence 

20 encoded in the site-specif ic recombination sec[uence at 
the jiinction between the two parts of the sequence, for 
instance the linker in single chain FV molecules. A 
problem with this is that there is only one open 
reading frame in the lox P sequence and the amino acids 

25 encoded by this may be incompatible with the expression 
of many proteins in functional form. If alternative 
lox P sites to the wild-type are used (eg see Figure 
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4), further different amino acid sequences may be 
generated, but the possibilities are still restricted. 

For instance, functional single chain Fv 
molecules can be constructed with 15 amino acid linkers 
5 encoded in part by the loxP recombination site. The 
length of the loxP site {34bp) however means that a 
minimum of 11 heterologous ("foreign") amino acids must 
be incorporated into the final expressed protein. This 
makes the incorporation of a loxP site into a 

10 continuous reading frame unsuitable for the 

construction of a diabody repertoire and also leaves 
little scope for the modification of scFv linkers to 
enhance expression. 

The present invention involves RNA splicing, 

15 particularly the use of self -splicing introns. This 

allows the recombination site to be inserted within the 
intron so that amino acids encoded by nucleotides which 
are spliced out are not incorporated into the final 
expressed protein. In such circumstances, the only 

20 "foreign" amino -acids which need be incorporated are 
those derived from the sequences at either end of the 
self -splicing intron. (Note: the amino acid 
composition and sequence of the product can be 
engineered with precision and amino acids inserted, 

25 substituted or deleted according to choice and using 
techniques known in the art . ) 

When a self -splicing intron is used, the amino 
acids that are incorporated derive from the PI sequence 
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at the 5' splice site (5'SS) and the PlO sequence at 
the 3' splice site (3'SS). These pair with the 
internal guiding sequence of the intron to form hairpin 
loops (Figure 1) and splicing then occurs as indicated. 

The use of self -splicing introns allows the use 
of recombination by lox P to be extended to 
construction of large libraries of contiguous 
polypeptide chains where the two parts of the chain 
separated by the intron are varied. 

In the application EP 93303614.7, priority from 
which is claimed by PCT/GB93/02492, an example is given 
of use of a loxP site inserted within a self-splicing 
intron with a bivalent or bispecific "diabody" . A 
"diabody" is a multivalent or multispecif ic multimer 
15 (e.g. bivalent or bispecific dimer) of polypeptides 

wherein each polypeptide in the raultimers coii^rises a 
first domain comprising a binding portion of an 
immunoglobulin heavy chain variable region linked to a 
second domain which comprises a binding protein of an 
20 immunoglobulin light chain variable region such that 

the domain of a given polypeptide cannot associate with 
each other to form an antigen binding site. Antigen 
binding sites are formed from an antigen binding site. 
Antigen binding sites are formed by multimerisation 
25 (e.g. dimerisationj of the polypeptides. 

The expression of bivalent diabodies from DNA 
containing a self -splicing intron is shown in Figures 1 
and 2. Application EP 93303614.7 also shows the use of 
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this system for chain- shuffling. (See also Figure 3.) 
PCT/GB93/02492 describes splicing out a lox P site 
using a self-splicing intron for a bispecific diabody 
(Example l of this application) . m these two earlier 
i applications the use of self -splicing introns was 

described for splicing only between the two domains of 
diabodies. The use of self -splicing introns to bring 
together two portions of polypeptide chain however has 
general applicability and can equally well be applied 
to single chain Pv fragments, peptide libraries or 
indeed any polypeptide sequence. 

The use of systems such as lox P which promote 
recombination allows one polypeptide sequence to be 
replaced by another one with a similar or different 
function, originally encoded on another replicon. This 
is particularly useful with polypeptide chains such as 
single chain Fvs which have two or more domains which 
contribute to function. The invention allows the use 
of two repertoires of nucleic acid, with a splice site 
between the two repertoires and proteins or peptides 
thus encoded selected, in one embodiment, termed 
"chain shuffling", one nucleic acid sequence is kept 
constant and the library of other chains recorabined at 
the lox P site in the intron. 
25 Self -splicing introns have been shown to be 

functional in B. coli using a system, in which the 
Tetrahymeaa intervening sequence {a group I self- 
splicing intron) was inserted into the gene encoding 
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the Qf-peptide of S-galactosidase (J.V. Price & T.R. 
Cech Science 228 719-722, 1985; R.B. Waring et al Cell 
40 371-380, 1985; M.D. Been & T.R. Cech Cell 47 207- 
216, 1986) . The presence of blue colonies indicated 
5 that self -slicing was functional in E.coli., because 
the a-peptide complemented the B-galactosidase enzyme 
acceptor. This system has been used in diagnosis of 
the intron sequences which are compatible with self- 
splicing. 

10 Although self -splicing introns have been inserted 

into fxinctional proteins as above splicing introns have 
not been used for protein engineering strategies or for ~ 
processes which involve the recombination of two 
repertories of nucleic acid. 

15 The present invention provides a DNA construct 

comprising a first sequence of nucleotides encoding a 
first peptide or polypeptide, a second sequence of 
nucleotides encoding a second peptide or polypeptide 
and a third sequence of nucleotides between the first 

20 and second sequences encoding a heterologous intron 
between RNA splice sites amd a site-specific 
recombination sequence within the intron. The presence 
and position of the RNA splice sites render the intron 
operable for splicing out of nucleotides from between 

25 the first and second sequences upon transcription of 
the DNA construct into RNA, which may result in 
splicing together of the first and second sequences. 
Depending on the intron used, one or more nucleotides 



may remain between the first and second sequences in 
transcribed RNA following splicing, resulting in one or 
more amino acids between the first and second peptides 
or polypeptides in the product of translation of the 
RNA. However, those skilled in the art will recognise 
that the first and second sequences may be termed 
" exon " sequences . 

The terra "heterologous" {or "foreign") indicates 
that the intron is one not found naturally between the 
first and second sequences in a position operable for 
removal of nucleotides from between the first and 
second sequences upon transcription. DNA constructs 
according to the present invention are "artificial" in 
the sense that they do not occur naturally, ie without 
human intervention by means of recombinant DNA 
technology. 

The first and second peptides or polypeptides may 
be any sequence of amino acids. Preferably, the first 
and second polypeptides together form a member of a 
specific binding pair (sbp) , such as the antigen 
binding site of an immunoglobulin (cuxtibody or antibody 
fragment). Thus, the combination of first and second 
polypeptides may form a polypeptide sbp member which is 
a scFv antibody fragment consisting of a VH domain 
linked to a VL domain by a peptide linker which allows 
the VH and VL domains of the sbp member to associate 
with one another to form em antigen binding site. (Bird 
et al. Science, 242, 423-426, 1988; Huston et al, PNAS 
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USArSS, 5879-5883, 1988) In such a case, the DNA 
construct comprises a first sequence of nucleotides 
encoding a VH or VL domain, a second sequence of 
nucleotides encoding a counterpart VL or VH domain and 
5 a third sequence of nucleotides, between the first and 
second sequences, comprising a heterologous intron. 
Upon transcription of the DNA construct into RNA and 
splicing out of nucleotides of the third sequence, 
nucleotides of the third sequence remaining in the RNA 
10 encode, and are translatable into, the peptide linker 
of the scFv antibody fragment. 

This principle, with nucleotides of the third 
sequence encoding and being translatable into amino 
acids of a linker joining the first and second peptides 
15 or polypeptide chains, may be used for any peptides or 
polypeptides, for example in the creation of peptide 
libraries. 

In preferred embodiments of the present 
invention, the first and second sequences encode 
20 peptides or polypeptides which are not linked in any 
naturally occuring polypeptide. The peptides or 
polypeptides may be derived from the same naturally 
occuring molecule but not linked directly by a peptide 
bond, ie they may be two parts of a polypeptide 
25 natxirally separated by one or more intervening amino 

acids. One or both of the first and second peptides or 
polypeptides may be an antibody fragment, for example 
VH, VL, CH, CL, VH-CH or VL-CL. The peptide or 
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polypfeptide need not be a complete domain. One or both 
of the first and second peptides or polypeptides may be 
encoded by a synthetic nucleotide sequence, eg one 
created randomly. Thus, a random sequence peptide or 
5 polypeptide library may be created for example by 
expression from a repertoire or population of DNA 
constructs, as disclosed, wherein the first and second 
exon sequences conqprise randomly-generated nucleotide 
sequences . 

10 The DNA construct may be transcribable into RNA 

which, following splicing, encodes a "diabody" 
polypeptide, ie a polypeptide comprising a first domain 
which comprises a binding region of an immunoglobulin 
heavy chain variable region and a second domain which 

15 comprises a binding region of an immunoglobulin light 
chain variable region, the domains being linked (eg by 
a peptide bond or peptide linker) but incap^le of 
associating with each other to form an antigen binding 
site. Where the domains are linked by a peptide 

20 linker, the linker may, for instance, be 10 amino acids 
or fewer in length. See Holliger et al, PNAS USA 90: 
6444-6448 (1993) and PCT/US93/02492 . Polypeptides of 
this kind are able to associate with one another to 
form multivalent or multispecif ic binding proteins. 

25 DNA constructs which can be transcribed into RNA which, 
following splicing, encodes such a "diabody" 
polypeptide may, however, be excluded from the present 
invention. 
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* Other examples of first and second peptides or 
polypeptides include any polypeptide comprising binding 
regions of immxinoglobulin heavy and light chain 
variable domains; Va/Vp domains of T cell receptors; T 
5 cell receptor/antibody (fragment) fusions; peptides, 
for example for epitope mapping of an antibody, 
receptor binding peptides, enzyme, eg protease, 
inhibitors; mutagenesis libraries of any multiple 
domain protein, for example nucleotide dehydrogenases 

10 which have nucleotide binding domains and substrate 
binding domains, adhesion molecules such as ICAM-l, 
receptors such as PDGF- receptor which have a ligand 
binding domain and a kinase domain, transcription 
factors which have a DNA binding domain and a second 

15 domain which interacts with a ligand - such as the 
glucocorticoid receptor. For a review of multiple 
domain proteins see Branden and Tooze, "Introduction to 
Protein Structure", Garland 1991. 

The intron may be a self -splicing group I intron 

20 such as ICEIO from Tetrahymena (T.R. Cech Ann. Rev. 

Biochem. 59 543-568, 1990). Splicing out of the intron 
occurs at the RNA level leaving behind sequences at the 
5' and 3' splice sites, which would encode three amino 
acids between the two peptide or polypeptide components 

25 of the product polypeptide. The self -splicing may be 

designed so that the number of amino acids remaining is 
different . 
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•other group I introns or group li self -splicing 
introns may be used. There are at least 149 self- 
splicing group I introns known, including: Tetrahymena 
thermophila rmA intron, Nexirospora crassa cytochrome 
b gene intron 1, Neurospora craesa mitochondrial rRNA, 
Neurospora crassa cytochrome oxidase subunit 1 gene 
oxi3 intron, phage T4 thymidylate synthase intron, 
Clamydomonas reinhardtii 23S rRNA Cr.LSU intron, phage 
T4 nrdB intron, AnaJbaena pre tRNA{Leu) intron. Group 
II self-splicing introns include yeast mitochondrial 
oxi3 gene intron57 and Podospora aneerina cytochrome c 
oxidase I gene. 

Self-splicing introns may be used in combination 
with recombination, for example, at a lox P site, in 
15 the construction of molecules. For example, a lox P 

site may be included in a self -splicing intron between 
the two domains (eg VH and VL) of a polypeptide chain. 
This may, for example, be recombined at the DNA level 
through a lox P site on another replicon carrying 
20 another variable domain gene and the appropriate region 
of a self -splicing intron. Self -splicing at the RNA 
level following transcription will now lead to a 
product polypeptide chain with a new combination of 
first and second polypeptides. 
^5 one aspect of the present invention the third 

sequence of nucleotides in the DNA construct, the 
intron, comprises a sequence for site- specif ic 
recombination. The sequence may be suitable for site- 
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specific recombination in vivo and/or in vitro. It may 
be the lox P site, a 34bp site at which recombination 
is catalysed by the protein Cre (Hoess et al., PNAS USA 
22: 3398-3402, 1982, and Sternberg et al., J. Mxol. 
5 Biol.; 150: 467-486, 1981). The 34bp of the lox P 

site consists of two 13bp inverted repeats separated by 
an 8bp non-symmetrical core (see Figure 4) . 

In order to provide more controlled recombination 
between two sequences leading to the resultant 

10 recombinant vectors desired, each vector may include 

two site -specific recombination sequences each of which 
is different from the other. The sequences should then 
be such that recombination will tcdce place between like 
sequences on different vectors but not between the 

15 different sequences on the same vector. The use of 
site-specific recombination allows first and second 
nucleic acid sequences originally on different (first 
and second) vectors/replicons to be brought together 
onto a single recombinant vector/replicon. 

20 Each of the first vectors and each of the second 

vectors may include a first site-specific recombination 
sequence and a second site-specific recombination 
sequence different from the first, site-specific 
recombination taking place between first site-specific 

25 recombination sequences on different vectors and 

between second site-specific recombination sequences on 
different vectors but not between a first site-specific 
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recomBination sequence and a second site-specific 
recotnbination sequence on the same vector. 

The first site-specific recombination sequence 
may be lox P obtainable from coliphage Pi and the 
5 second site-specific recombination sequence a mutant 
io2£ P sequence, or vice versa. Potentially, both the 
first and second site-specific recombination sequences 
may be mutants, as long as the first sequence will not 
recombine with the each other auid second sequences will 
10 recombine with each other. 

A suitable mutant lox P sequence is lox P 511. 
See Figure 4. 

The first vectors may be phages or phagemids and 
the second vectors plasmids, or the first vectors may 
15 be plasmids and the second vectors phages or phagemids. 
This system (ie employing site-specific 
recombination but not intron splicing) has been used in 
the preparation of antibodies displayed on phage (P. 
Waterhouse et al., Nuc. Acid Research 21: 2265-2266, 
20 1993; and W093/19172) . 

In one embodiment, the recombination is 
intracellular and takes place in a bacterial host which 
replicates the recombinant vector preferentially over 
the first vectors and the second vectors. This may be 
25 used to enrich selection of successful recombination 

events. The intracellular recombination may take place 
in a bacterial host which replicates plasmids 
preferentially over phages or phagemids, or which 



W0 9Sa5388 PCT/GB94A)2662 

14 

replicates phages or phagemids preferentially over 
plasmids. For instance, the bacterial host may be a 
PolA strain of E.coli or of another gram-negative 
bacterium. PolA cells are unable to support 
5 replication of plasmids, but can support replication of 
filamentous phage and phagemids (plasmids containing 
filamentous phage intergenic regions) . So, for 
instance, if the first vectors are plasmids containing 
a first marker gene, and the second vectors are phage 

10 or phagemids containing a second marker gene, selection 
for both markers will yield recombinant vectors which 
are the product of a successful recombination event, 
since recombination transferring the first marker from 
plasmid must take place in order for that marker to be 

15 replicated and expressed. 

The bringing together of nucleic acid for two 
components or subunits of a product polypeptide, 
initially present on two separate replicons enables 
favoxirable combinations of sxibunit genes to be isolated 

20 directly without recourse to extensive recloning, e.g. 
using phage display- This may be achieved by 
recombination between the replicons once they have been 
introduced into the same cell. In a preferred 
configuration, recombination events are effected such 

25 that the genes for one of the component is recombined 

onto a recipient replicon which contains the gene for a 
partner con^onent. Preferably, the recipient replicon 
is capable of being packaged into a bacteriophage 
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particle. Most preferably, the genes encoding one or 
more of the siibunits is fused to a capsid gene such as 
glll in order that the functional multimer can be 
displayed on the surface of the rgdp. 
5 A variety of recombination systems are known, and 

many of these could be harnessed in such as way as to 
effect recombination between replicons. 

One of the most fully understood site-specific 
recombination systems is that used in integration and 
10 excision of bacteriophage lambda (In "Escherichia coli 
and Salmonella typhimurium. Cellular and Molecular 
Biology.- (1987). ppl054-1060. Neidhart, P.C. Editor in 
Chief. American Society for Microbiology) . This 
bacteriophage can follow two developmental pathways 
15 once inside the cell; lysis or lysogeny. The lysogenic 
pathway involves integration of the lambda genome into 
the chromosome of the infected bacterium; integration 
is the result of a site-specific recombination between 
a ca. 24 0bp sequence in the bacteriophage called att P 
20 and a 25bp site in the bacterial chromosome called att 
B. The integration event is catalysed by a host 
encoded factor called IHF and a phage encoded enzyme 
called Int recombinase, which recognises a I5bp region 
common to the two att sites. The integrated DNA is 
25 flanked by sequences derived from att B and att P, and 
these are called att L and att R. The integration 
event is reversible and is catalysed by Int, IHF and a 
second bacteriophage encoded enzyme, Xis. It is 
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envisaged that this system could be used for secjuence 
transfer between replicons within E. coli. For 
example, the donor gene could be flanked by att L amd 
att R sites such that when Int and Xis proteins are 
provided in host cell, recombination between att L and 
att R sites would create a circular DNA segment 
containing the donor gene and a recreated att B site. 
This circular segment could then recombine with an att 
P site engineered into the recipient plasraid. 

For the work described in this application, the 
lox P/Cre system was chosen of the possibilities 
available because the recombination is highly sequence- 
specific, very efficient and occurs at a short target 
site that is readily incorporated into cloning vectors. 
However, other site-specific recombination systems may 
be used, for instance: flp recombinase (A. Landy, 
Curr. Opinion Genetics Devel. 3 699-707, 1993). 

A way of enriching for productive recombination 
events is to employ mutant sites. Several mutemts of 
the lox P sequence are known, and these are compromised 
with respect to their ability to recombine with each 
other and the wild-type lox P sequence (Hoess, R.H., 
Wierzbicki, A. and Abremski, K. (1986) Nucl. Acids Res. 
14, 2287-2300) . For example, lox P 511 has a G->A 
point mutation in the central 8bp segment, with the 
result that it will only recombine with other lox P 511 
sites, but not the wild- type lox P sequence (Hoess, 
R.H. Wierzbicki, A. and Abremski, K. (1986) et supra.). 
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Placefoent of wild-type and mutant lox P sequence 
combinations can direct which recombination events are 
possible. The sites loxPl, loxP2, loxP3 and loxP4 
(Figure 4) can be used in a similar way to loxPBll. 
5 These sites do not recombine significantly with 
loxPSll. There is in some cases a degree of 
recombination between the loxPWT site and these mutant 
sites, derived from it. For instance, in one 
experiment 5% recombination was observedbetween loxP3 

10 and loxPWT sites. All of these new loxP sites 

recombine efficiently with identical sites, ie like 
sites, eg one loxP4 site with another loxP4 site, and 
show strong preference for this over recombination with 
a different site. 

15 Provision of further different mutant loxP sites 

permits even greater control over the occurrence of 
recombination events leading to more complex, 
controllable and efficient recombination strategies 
being possible. The availability of these loxP sites 

20 has allowed the construction of a vector system 

including 3 loxP sites as in Example 6. This 31oxP 
system offers two additional features compared with the 
systems containing two loxP sites: 

(a) It should facilitate chain shuffling of light 

25 and heavy chain genes for affinity maturation of 
antibody fragments (see Marks et al (1992) , 
Bio/Technology 10, 779-783) since one variable domain 
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may be kept cons taint and a library of VH and VL genes 
recombined with it using an appropriate donor vector. 

For example, a clone specific for an antigen may 
be isolated where the gene for a VH domain of a scFv 
5 fragment is located between loxPSll and loxP wt of a 
vector containing 3 loxP sites, such as fd31ox. A 
library of VL domains may then be shuffled with the VH 
domain gene kept constant by recorabining the clon in 
the 3 loxP site vector with a library of VL genes on a 

10 donor vector such as pUC19 which are located between 
the IoxP4 site and the loxP 511 site. The library of 
VL domain genes is now encoded in the 3 lox site vector 
and scFv fragments, eg with improved affinity, may be 
selected from the phage displayed scPv fragment 

15 repertoire. 

Although chain shuffling may be performed in 
2loxP systems, this 31oxP system gives more 
flexibility, particularly to the nature of the 
replicon, phage or plasraid, where the reshuffled 

20 repertoire is expressed, since both repertoires are 
flanked by loxP sites. 

Example 6 and Figure 13 show the use of a loxP 
system in model experiments for the construction of a 
diabody or single chain Fv repertoire where the VH and 

25 VL genes are separated by a self -splicing intron 

containing a loxP site. The design of the system will 
faciliate chain shuffling as above. 
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(b) It facilitates the transfer of light and 
heavy chain gene pairs which have been selected on the 
surface of filamentous bacteriophage for binding to 
antigen into a soluble expression vector for expression 
5 of e.g. soluble scFv fragments, which at present needs 
to be done by cloning using restriction enzymes. The 
transfer by recombination coxxld be achieved by creating 
an expression vector containing a new mutant loxP site 
such as loxP4 and the WT site and by recombination 
between these two sites and the corresponding sites on 
the fd31ox vector. Model experiments for this are 
described in example 6 and Figure 13 . 

The use of three different loxP sites also 
allows, for example, the recombination of three 
15 sequences in order. One sequence to be recombined 

could be flanked by loxP and loxPSll, a second sequence 
by 10XP511 and loxP3. These sequences may then be 
recombined into a third replicon containing a third DNA 
sequence and three loxP sites. The location of 2 loxP 
20 sites within different self splicing introns allows the 
three sequences to be expressed continuously as shown 
in Figures 7 and 8. 

Selection of productive arrangements may be 
facilitated by use of a polA strain of bacteria, 
25 preferably B.coli or other gram negative bacterium. 

These cells are deficient in DNA polymerase I and are 
unable to support replication of plasmids (Johnston, S. 
and R, D.S. 1984, supra.). However, they are able to 
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suppolrt replication of filamentous phage and plasmids 
containing filamentous phage intergenic regions. If 
Cre-catalysed recombination is performed in polA 
bacteria, by selecting for the presence of both 
selectable markers in the same polA cell successful 
recombination events are enriched, since recombination 
must take place for the second marker gene to be 
replicated and expressed. The resulting cells then 
contain the complete repertoire and can be propagated 
as cells and infected with helper. phage to produced 
phagemids containing the genes for both chains and 
expressing them on their surface. 

The invention also provides a vector comprising a 
DNA construct as disclosed. Generally, the vector 
comprises nucleic acid necessary for expression. The 
vector may comprise nucleic acid for secretion of the 
product' polypeptide upon expression. 

The present invention also provides a method of 
producing a polypeptide product which comprises a 
combination of a first peptide or polypeptide component 
and a second peptide or polypeptide component, the 
method comprising: 

providing a DNA construct comprising a first 
sequence of nucleotides encoding a first peptide or 
25 polypeptide, a second sequence of nucleotides encoding 
a second peptide or polypeptide and a third sequence of 
nucleotides between the first and second sequences 
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encoding a heterologous intron with a site-specific 
recombination sequence within the intron; 

transcribing DNA of the construct into RNA; 

causing or allowing splicing of nucleotides of 
5 the third sequence to produce an RNA molecule encoding 
the polypeptide product; 

translating the RNA molecule into the polypeptide 
product . 

The transcription, splicing and translation steps 
may take place in in vitro or in vivo systems. 
Conveniently, and particularly preferably for the 
construction of repertoires, these steps are performed 
in vivo, eg in E. coli. Splicing may also be 
accomplished, less preferably, using in introns which 
are not self -splicing, by introducing the components of 
the splicing apparatus of eukaryotic cells, which 
promote splicing (J. A. Wise Science 262 1978-1979, 
1993; A.J. Lamond, BioBsaays 15 595-603, 1993), into eg 
B. coli . 



20 



The DNA construct provided may be any as 
discussed above. Suitable vectors for expression 
(transcription) can be chosen or constructed, 
containing appropriate regulatory sequences, including 
promoter sequences, terminator fragments, 
25 polyadenylation sequences, enhancer sequences, marker 
genes and other sequences as appropriate, as is well 
known to those skilled in the art. For further details 
see, for example, Molecular Cloning.- a Laboratory 
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Manual: 2nd edition, Sambrook et al, 1989, Cold Spring 
Harbor Laboratory Press. Transformation procedures 
depend on the host used, but are well known. 

Preferably, a phage or phageraid vector is used 
5 and the vector, with the DNA construct, packaged into a 
bacteriophage particle. Advantageously, the- 
polypeptide product comprises a domain which is a 
surface component of an organism such as a 
bacteriophage, for example a filamentous bacteriophage 
10 such as fd or M13. Preferably, the surface component 
is GUI of bacteriophage fd or the equivalent from 
another filamentous 'phage. Suitable technology is 
described in WO92/01047, WO92/20791, W093/06213, 
W093/11236, W093/19172 and PCT/GB93/02492 . Thus, the 
15 provided DNA construct is packaged into a particle 

which displays on its surface the polypeptide product 
of expression from the construct, including the 
splicing step. In this way, polypeptide product with 
binding affinity or enzymatic eg catalytic affinity for 
20 a target can be extracted from medium or selected from 
a mixture of different polypeptide products without 
such binding affinity or enzymatic activity, by contact 
with target eg using a chromatographic technique. 
Where the polypeptide product is a sbp member, 
25 selection may be on the basis of binding affinity for 

complementary sbp member: eg an immunoglobulin binding 
domain (such as scFv fragment) can be selected on the 
basis of binding affinity for antigen. 
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• The step of provision of a DNA construct may 
actually involve the provision of a plurality, eg a 
repertoire, of constructs with different nucleic acid 
sequences. The term -repertoire- is used to indicate 
i genetic diversity, ie variety in nucleotide sequence, 
and generally implies a large number of different 
sequences, perhaps of the order of millions (eg 10'-10»- 
10»-lO") . Highly diverse repertoires may be created 
when a sequence for site-specific recombination, (as 
discussed, eg lox P wild-type or mutant) , is included 
within the third sequence in the DNA construct at a 
site compatible with splicing upon transcription. The 
size of a library generated by recombination between 
one library and another is limited only by the 
transfection efficiency, m principle, if each library 
contains, lo' clones, each recombination may introduce a 
further level of diversity of lO', thus recombination 
between a first repertoire encoding 10' different VH 
domains with a second repertoire encoding lo' different 
VL domains yields a recombinant repertoire encoding lo" 
different product polypeptides. Similarly, two 
libraries of lo' clones can be recombined to give a 
library of 10* clones. 

For example, a first repertoire of replicons 
comprising nucleic acid encoding a repertoire of first 
peptide or polypeptide component may contain part of a 
self -splicing intron, while a second repertoire of 
replicons comprising nucleic acid encoding a repertoire 
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Of second peptide or polypeptide component contains a 
con^lement part of the self -splicing intron. The 
replicons in each of the first and second repertoires 
of replicons each comprise a sequence for site-specific 
5 recombination, suitably positioned such that 

recombination of a replicon from the first repertoire 
of replicons with a replicon from the second repertoire 
of replicons results in formation of the self -splicing 
intron in the resultant recombinant replicon. 

10 Alternatively, replicons in either or both of the first 
and second repertoires may contain a complete self- 
splicing intron. 

The first and second repertoires of replicons may 
be recombined ("crossed"), eg at a site-specific 

15 recombination sequence, to produce a third repertoire 
of (recombinant) replicons which includes nucleic acid 
encoding a plurality of different combinations of first 
and second peptide or polypeptide component, with a 
self- splicing intron between the nucleic acid encoding 

20 the first and second peptide or polypeptide components 
on each replicon. The recombination may take place in 
vivo in bacterial host cells following transfection 
with the first repertoire of replicons and subsequent 
transfection with the second repertoire of replicons. 

25 If the sequences for site-specific recombination are 
lox P, the recombination may be catalysed by Cre 
recombinase . 
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Transcription of nucleic acid in the third 
repertoire of replicons into RNA is followed by 
splicing out of the intron containing the sequence for 
site-specific recombination, leaving mRNA coding for 
5 polypeptide product which can be translated into the 

polypeptide product. The production of a repertoire of 
polypeptide products coti^rising different combinations 
of first and second peptide or polypeptide components . 
may be followed by a step of selection of products of 

10 interest, such as those with a particular binding 
specificity or enzymatic activity. 

Each replicon in the third repertoire of 
replicons may comprise a sequence enabling packaging of 
the replicon into a bacteriophage particle, and the 

15 polypeptide product may comprise a surface component of 
a bacteriophage, as discussed. Then, particles may be 
selected from a repertoire of particles by their 
display of polypeptide product with a binding 
specificity or enzymatic activity of interest. Each 

20 selected particle then contains DNA encoding that 
polypeptide product. 

Figure 5. demonstrates the principle for use in 
production of a scFv repertoire. There the "first 
polypeptide component" of the polypeptide product is a 

25 VH domain and the ^second polypeptide component" of the 
polypeptide product is a VL domain. A lox P site is 
included within a Class I self -splicing intron. The 
peptide linker of each scFv fragment in the product 
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repertoire ie formed, at least in part, by remnants of 
the splice sites left after splicing out of the intron 
between the VH and VL domains upon transcription. 
Instead of using two repertoires in the 
i generation of a recombinant repertoire for expression, 
a single first or second peptide or polypeptide 
con?)onent may be "chain shuffled" against a repertoire 
of corresponding second or first peptide or polypeptide 
component. Thus, in the generation of a repertoire of 
scPv fragments to be used in selection for a scFv 
fragment able to bind to an antigen of interest, either 
a VH or a VL domain known to be able (with 
complementary VL or VH domain) to bind to the antigen 
may be combined with a repertoire of complementary VL 
15 or VH domain to produce a repertoire for expression 

followed by selection on the antigen for pairings able 
to bind, 

A further aspect of the present invention 
provides nucleic acid comprising a sequence of 

20 nucleotides encoding self -splicing intron with a site- 
specific recombination sequence, such as a loxP site or 
a mutant or derivative thereof, within the intron. 
Preferably such nucleic acid consists essentially of a 
sequence of nucleotides encoding self- splicing intron 

25 with a site-specific recombination sequence within the 
intron. Such nucleic acid may be isolated and is 
suitable for use in creation of constructs for use in a 
method as herein disclosed. Preferably, the nucleic 
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acid comprises restriction sites flanking the intron. 
for ligation of nucleic acid encoding or peptides. The 
nucleic acid may be incorporated in a vector operably 
linked, ie under the control of, a promoter for 
5 expression. Other preferred features are as disclosed 
herein with reference to the methods and the DNA 
constructs. In particular, the site-specific 
recombination sequence within the intron is preferably 
heterologous, as discussed. 

10 SELF-SPLICING TO FORM DIABODIES OR SINGLE CHAIN Fv 
FRAC^SENTS 

A recombination site (eg. lox P) may be included 
in a self -splicing intron between the two antibody 
domains of the polypeptide chain. This may, for 

15 example, be recombined at the DNA level through a lox P 
site on another replicon carrying another variable 
domain gene and the appropriate region of a self- 
splicing intron. Self -splicing at the RNA level 
following transcription will now lead to a diabody 

20 polypeptide chain with a new combination of variable 

domains or a single chain Fv polypeptide, depending on 
the length of the linker region encoded. In 
PCT/GB93/02492 the splicing of an intron from RNA 
encoding a diabody polypeptide is described. This can 

25 readily be extended to single chain Fv fragments by 

introducing the sequence encoding the extra amino acids 
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on either side of the RNA splice sites encoding the 
appropriate length of linker. 

Chain shuffling can be performed for bivalent or 
bispecific diabodies or for single chain Fv fragments 
5 using the systems described in Figxires 3 and 5. As 
noted above, a further level of control may be 
established by the use of a system with 3 loxP sites, 
as shown in Figure 13. The expression of diabody and 
single chain Fv molecules from clones containing loxP 

10 sites within self splicing introns is demonstrated in 
examples 1, 2 and 4. Example 3 demonstrates the 
feasibility of making a large library which recombines 
two exons into a longer continuous sequence. This 
methodology for making a repertoire can be applied to 

15 other molecules such as single chain Fv fragments and 

dicibodies where the VH and VL genes replace the peptide 
sequences. Example 6 describes model experiments which 
demonstrate that recombination can be performed between 
loxP sites configured for the construction of diabody 

20 or single chain Fv repertoires. It is concluded that 
this methodology is suitadsle for the libraries 
described in example 3 and Griffiths et al (1994, 
supra) and that libraries of more than 10" independent 
scFv or diabody clones are feasible. 

25 As discussed further herein, introns with splice 

sites, such as self -splicing introns, containing an 
internal lox P site may be applied to any other system 
where two functional domains come together, for 
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instance T cell receptors or two domain proteins. In 
addition to proteins with natixral variants such as 
antibodies, for any two domain proteins mutagenesis 
libraries can be made for the two domains and then 
5 combined using the lox P system. 

In addition to splicing together libraries of 
domains, such as VH and VL domains, parts of domains 
may be spliced together, eg using a self-splicing 
intron. For instance, the use of a self-splicing 
10 intron containing a recombination site such as lox P in 
framework 3 of V domains allows recombination of 
fragments containing CDRs 1 and 2 with fragments 
containing CDR3, eg in CDR3 shuffling. 

SPLICING INTRONS/RECC^INATION IN THE CONSTRUCTION OF 

15 PEPTIDE LIBRARIES 

Libraries may be made where two sequence encoding 
peptides are encoded separated by a self -splicing 
intron containing a recombination, eg lox P, site. For 
instance, two separate libraries of ten amino acid 

20 peptides can be cloned and then recombined via the lox 
P 511 and lox P sites as is shown in Figure 6, The 
amino acids encoded by the region of the 5' and 3' 
splice sites make this into a total 25 amino acid 
peptide with 5 constant amino acids in the centre. The 

25 peptide library can then be used for a number of 

purposes, for instance the epitope mapping of antibody 
binding sites or to derive new molecules such as 
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receptor binding proteins, protease inhibitors or 
substrates . 

Example 3 shows that a large phage display 
library of ca. 5 x 10" recombined 25 amino acid 
5 peptides may be constructed using recombination between 
loxP sites contained in a self splicing intron and 
peptides containing the epitope recognised by an anti- 
p53 antibody selected. Constrained peptide libraries 
may be made by incorporating a cysteine residue in each 

10 of the 10 amino acid peptides to be recombined so* that 
a disulphide bond is formed and the peptides between 
the cysteine would form a loop. The five amino acid 
linker may be varied in length and amino acid sequence 
by varying the 5' and 3' splice sites and the reading 

15 frame. The number of random amino acids may also be 
varied and need not be the same on either side of the 
linker. This example demonstrates the feasibility of 
making a large library which recombines two exons into 
a longer continuous sequence. 

20 USE OF TWO OR MORE INTRONS IN CONSTRUCTION OF 
RECOMBINANT ANTIBODIES 

Two or more splicing introns may be used to link 
together three or more nucleic acid sequences encoding 
polypeptides. This may be particularly advantageous in 

25 constructing libraries where V-D-J recombination (for 

the antibody heavy chain) occurs in E.coli. The use of 
site-specific recombination sequences (e.g. lox P) 
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within the introns (e.g. using the scheme in Figure 7) 
allows this V-D-J recombination of VH domains to occur 
in E.coli in the presence of recombinase (Cre for lox 
P) . The VH, DH and JH regions may be natural V, D and 
5 J genomic segments regions or derived from synthetic 
oligonucleotide sequences, perhaps of different 
lengths, especially for the D region, so that the range 
of CDR3 lenghts generated by the recombination may 
reflect the same (or a modified) distribution of 
10 natural CDR3 lengths and the presence or absence of N 
base addition. Figure 7 shows the use of lox P to 
achieve V-D-J recombination to obtain a single chain Fv 

molecule and Figure 8 shows the e3q)ression of this 

molecule. The introns and splice donor and acceptor 
15 sites need to be designed to ensure that splicing does 

not cut out the exon sited between the two introns. 

The introduction of a fourth intron containing a 

different recombination site would allow the linking of 

different CHI domains to the J region. 
20 An analogous system may be used for T cell 

receptors a similar system may be used for reshuffling 

V and J regions of light chains. 

SELECTION OF SEQUENCES FOR THE 5' AND 3' SPLICE SITES 
When an intron is deleted by a self-splicing 
25 process, a residue of the intron is left behind within 
the coding region of the polypeptide, due to the 5' and 
3' splice sites. Example 1 shows two different amino 
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acid sequences incorporated into a diabody due to this 
residue of the intron, with variation in expression 
occurring. There are likely to be differences in the 
expression of a number of proteins depending on the 
5 nature of the PI and PIO sequences. Therefore, there 
may be a need in certain cases to identify amino acids 
which are compatible with successful splicing of the 
intron and expression of protein. 

Identification of suitable amino acids 

10 incorporated due to the bases at the 5 'and 3' splice 
sites may be done by mutating bases (eg randomly) in 
the region of the internal guiding sequence with 
complementary bases which form the PI hairpin loop of 
the intron. If the intron is now inserted, between the 

15 nucleic acid encoding the first and second peptides or 
polypeptides, for instance between the VH and VL 
domains of antibody fragment, such that efficiently 
spliced polypeptide product is produced and may be 
displayed on phage and selected by binding to target, 

20 those sequences compatible with efficient splicing can 
be selected. Similarly, sequences of the 3' splice 
site can be varied together with those of the internal 
guiding sequence and those which are efficiently 
spliced selected by the expression of the polypeptide 

25 sequence. 

The above procedures apply when the bases of the 
internal guiding sequence that are to be changed only 
participate in one of the PI and PIO hairpin loops. It 
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can be seen from Figxire 1 that the central bases of the 
internal guiding sequences participate in both the PI 
and PIO hairpin loops. Thus for these bases it is 
necessary to mutate the bases of both the 5' and 3' 
5 splice sites as well as the internal guiding sequence 
in order to maintain corapleraentarily and self splicing. 

Example 4 shows that mutations may be made at the 
3' splice site and internal guiding sequence of the 
self splicing intron to allow the encoding of amino- 
10 acids compatible with higher expression, after self 
splicing of RNA, of both diabody and single chain Fv 
antibody fragments. This directed mutation procedure 
may be applicable to other sites of the self splicing 
intron . 

15 When repertoires are to be made, the GLSSG 

sequence used in Example 1 may be used as the first 
trial sequence for the sequence linking the ' two 
polypeptides following splicing out of the intron. 
Further sequences identified, eg using a mutation 

20 process as described in Example 4, may be used as 
alternatives . 

To select the sequences of the splice site at the 
5' end of the exon which are retained in the mature 
protein after splicing of the pre-mRNA that are 

25 compatible with self -splicing, the sequences of known 
self -splicing introns may be examined (F. Michel and 
E. Westhof J. Mol. Biol, 216 581-606, 1990; F. Lisacek 
et al J. Mol. Biol. 235 1206-1217, 1994). Sequences 
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incorporation of favourable amino acids may then be 
chosen . 

CONTROL OF SELF- SPLICING USING STREPTOMYCIN 
5 Streptomycin prevents self -splicing. Thus the 

use of streptomycin in Str-R E, coli will prevent 
splicing occurring in transcribed RNA. The removal of 
streptomycin will aloow the generation of a spliced RNA 
product, leading to, on translation, a protein product 

10 which is only generated on splicing. Thus, one could 
have a cloned gene which does not express an active 
protein in the presence of streptomycin in the growth 
medium, but does so in its absence. This may be useful 
for expressing proteins which are toxic or reduce 

15 growth in E. coli, for example antibodies directed 
against E. coli proteins or inhibitors of E.. coli 
enzymes, where expression of the toxic protein can be 
switched off until required. 



The present invention will now be illustrated 
20 further by way of example. Modifications and 

variations within the scope of the present invention 
will be apparent to those skilled in the art. 

All documents mentioned in the text are 
incorporated herein by reference. 
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Figure 1 shows a schematic of a self -splicing 
intron, including the Pi and PIO helices and the 
internal guiding sequence. The splice sites are marked 
by arrows. 

Figure 2 illustrates the expression of a single 
chain Pv or diabody polypeptide from DNA containing a 
self splicing intron. The sequences flanking the self 
splicing intron will determine the length of the 
peptide linker. Ribosorae binding sites are indicated 
by open circles, Lg3 is the leader sequence for phage 
fd gene III. 

Figure 3 illustrates chain shuffling of a diabody 
(or a single chain Fv) molecule. It shows the 
replicons generated by Cre-mediated recombination 
between the acceptor phage vector fdDOG-2dialoxsplice 

(A) and the donor plasmid vector pUC19-2dialoxsplice 

(B) . A is based on fd-tet-DOGl, with the chain VHA-VLB 
in one cistron under control of the gene III promoter. 
Between VHA and VLB is inserted the self- splicing 

20 intron from retrah>Tnena containing the lox P 51l 

recombination site inserted at a site compatible with 
self-splicing activity. B is based on pUC19 and 
contains lox P 511, the distal part of the self- 
splicing intron from Terahymena, VIA, and the lox P 

25 wild type sequence in the same arrangement as A. 

Within E. coli an equilibrium between the six replicons 
develops due to the reversible nature of recombination 
in th lox-Cre system. The same scheme will apply to 
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both single chain Pv and diabody molecules, depending 
on the length of the linker peptide between the 
variable domains. Product E would express fd phage 
displaying a single chain Pv or a diabody depending on 
5 the linker length used. 

A and B can cointegrate by recombination between 
either mutant or wild-type loxP sites to create 
chiraaeric plasmids C and D respectively. Further 
recombination can then occvir between the two wild- type 
10 or the two mutant loxP sites, to generate the original 
vectors {A and B) or two new vectors {E and F) . The 
light chains of A and B are therefore exchanged, and 
product E now encodes fd phage displaying a single 
chain Pv or a diabody depending on the linker length 
15 used. Product F contains the VL originally in A. 

Within E. coll an equilibrium between the six replicons 
develops due to the reversibel nature of recombination 
in the lox-Cre system. 

Figure 4 shows the sequence of wild type and 
20 mutant lox P sites. 

Figure 5 illustrates the generation of a single 
chain Fv repertoire by recombination between 
repertoires of VH and VL domains. 

Figure 6 illustrates the generation of a peptide 
25 library by recombination between two replicons (a) 

'pUC19-PEP and (b) fdDOG-PEP. rbs represents ribosome 
binding sites; LpelB is the leader peptide sequence; 
glll is fd phage gene III; lOaa is a random 
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oligonucleotide (NNK)io encoding ten amino acid residues 
(K is an equimolar mixture of G and T) ; * is cm ochre 
stop codon. The expressed secjuence is: 
aal-aa2-aa3-aa4-aa5-aa6"aa7-aa8-aa9-aal0-A-L-L-R-Y- 
5 aall-aal2-aal3-aal4-aal5-aal6-aal7-aal8-aal9-aa20. 

Figure 7 illustrates the recombination of V, D 
and J regions using recombination between lox P sites 
within self splicing introns. The VH, DH and JH regions 
may be natural VH, DH and JH regions or derived from 

10 synthetic oligonucleotides sequences, perhaps of 

different lengths, especially for the D region, so that 
the range of CDR3 lengths generated by the 
recombination, reflects the same (or a modified)- 
distribution of natural CDR3 lengths. The scheme is 

15 shown for a single chain Pv molecule with the VL domain 
fused to gene III protein. 1x1, 1x2 and 1x3 are 3 
different lox P sites e.g. wild type lox P, lox P511 
and lox P3. in2 and in3 are the two introns which 
contain 1x2 and 1x3 sites such as the Tetrahymena rRNA 

20 and the T4 sunY intron. (a) Acceptor vector; (b) donor 
vector 1; (c) donor vector 2; (d) recombined fd phage. 

Figure 8 shows the transcription, splicing and 
expression of a single chain Fv molecule constructed as 
in Figure 7, containing recombined V, D and J 

25 regions, fused to gene III protein. The nucleic acid 
regions encoding the amino acids of the final product 
are shown as Expressed scFv-gene III fusion. (a) DNA; 



wo 95/15388 



PCT/GB94A)2662 



38 

(b) Primary transcript; (c) Spliced transcript; (d) 

Expressed scFv-genelll fusion. 

Figure 9 shows an alternative final product from 

recombination which mimics VH, DH and JH recombination 
5 in vitro to generate a new VH domain. Two separate 

libraries of sequences of random nucleotides (x and y) 

which encode 0 to 15 amino acids are made and 

recombined using the lox/Cre system. 1x1 and 1x2 are . 

two distinct lox sites such as lox P5 511 and lox P 
10^ (wild type). This scheme requires only one self- 

splicing intron and two different lox P sequences. 

Figure 10 shows the construction of the vector 

fdDOG-PEP. (a) pUC19 NQIO K; (b) fd DOG-FCK; (c) 

INTRON_LoxP(wt) ; (d) fdDOG-PEP; r.b.s. - ribosorae 
15 binding sites; LpelB - leader peptide sequence; glll - 

fd phage gene III (glll) ; 10 aa - random 

oligonucleotide; * - OCHRE stop codon. 

Figure 11 shows the construction of the vector 

PUC19-PEP. (a) pUC19 NQIO K; (b) fd DOG-BLX; (c) 
20 INTRON_LoxP(wt) ; (d) fdDOG-PEP; r.b.s. - ribosome 

binding sites; LpelB - leader peptide sequence; glll - 

fd phage gene III (glll) ; 10 aa - random 

oligonucleotide; * - OCHRE stop codon. 

Figure 12 shows construct fdDWT/3 and three 
25 different linkers formed on expression from constructs 

described in example 4. Sequence A is derived from the 

unmutated self splicing intron. Sequence B is derived 

from the self splicing intron mutated at the 3' splice 
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site and in the internal guiding sequence. Sequence C 
shows the sequence derived from the single chain Fv 
fragment. Bases contributing to the PI and PIO hairpin 
loops are underlined. Restriction site bases are 
5 outlined. The diagonal slashed line shows the bases 
between which the self -splicing intron is spliced out. 
T7 is the promoter for T7 RNA polymerase. Fx is a site 
• for Factor X protease. Part D shows the schematic of 
the self splicing intron highlighting the bases which 

10 are mutated (G to C in the PIO hairpin loop and its 

complementary base in the internal guiding sequence) . 

Figure 13 A shows the fd phage acceptor vector, 
fdDWT/4 containing 3 lox sites is shown. It contains 
the VH and VL genes of the anti-NIP clone G6 {Griffiths 

15 et al, 1994 supra). The sites loxPSll and loxPWT flank 
the VH gene and the sites loxPWT and loxP4 flank the VL 
gene. The loxPWT site is in the self splicing intron 
and the loxP4 site sits between the VL gene and gene 
III. The diabody or single chain Fv polypeptide chain 

20 encoded is expressed as a fusion with the gene III 

protein, A site for the factor X protease is included 
between the VL gene and gene III to allow the 
possibility of the elution by proteolysis of phage from 
the antigen during selection procedures. Alternative 

25 versions of fdDWT/4 were also made with the site loxP4 
replaced with loxP3 cuid loxPl respectively. The donor 
vector PDN8 contains the VH-DIO gene flanked by loxPSll 
and loxPWT sites. The donor vector pRWT/4 contains the 
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VL-DIO gene flanked by loxPWT and loxP4 sites. In the 
donor vectors pRWT/3 or pWT/l the loxP4 site of pRWT/4 
is replaced by the loxP3 or loxPl site respectively. 
The expression vector pEX511/4 contains the S12 gene, 
5 which confers streptomycin sensitivity on bacteria, 
flanked by loxPSll and loxP4 sites. 

Figure 13B sunmarises the recombination 
efficiencies obtained in the experiments described in 
example 6. The left hsmd loxP site is loxPSll, the 
10 middle loxP site is the loxP site within the self 

splicing intron and the right hand loxP site is the 
loxP site between the VL gene and gene III. 



EXAMPLE 1: USE OF SELF -SPLICING INTRONS IN THE 
CONSTRUCTION OF DIABODY MOLECULES 

In the work described in this example, a self 
splicing intron was introduced between the VH and VL 
domain genes of two antibodies cloned in the diabody 
format, NQll and D1.3 directed against 2-phenyloxazol- 
5-one and hen egg lysozyme respectively. This self 
splicing intron was shown to be spliced out following 
expression, as determined by the expression of 
functional bivalent diabodies. 



25 



Construction of NOll and D l.3 clnn^s containing a g^ ,7,,f 
^plicincr intron. excised to leave a five amino am'H 
linker between VH and VL do mains of bivalent diahodif>f^ 
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« 

The self-splicing intron from Tetrahymena (T.R. 
Cech Ann. Rev. Biochera. 59 543-568, 1990) has been 
shown to be able to splice in the E.coli cytoplasm. 
Such a self-splicing intron, from clone ICEIO (Ian 
5 Eperon, University of Leicester) was inserted between 
the genes encoding the VH and VL domains of the 
antibodies Dl.3 and NQll in such a way as to create 
upon splicing out an open reading frame encoding a 
diabody with linker VH-GLSSG-VL. Without splicing no 
10 functional diabody can be produced as the self splicing 
intron contains several stop codons din 3 reading 
frames . 

A restriction site for BstEII was incorporated at 
the 5' end of the primer TlbaBstEII and a Sad 

15 restriction site introduced in the primer TlfoSac. 

This allowed the self splicing intron fragment to be 
cloned in a 2 -way ligation reaction into the expression 
vectors pUC119D1.3 (encoding the V domains of the Dl.3 
anti-lysozyme antibody) or pUClSNQll (encoding the V 

20 domains of the anti-phOx antibody NQll) each cut with 
BstEII and Sad . 

TlbaBstEII primes at the 5' end of the self 
splicing intron and conserves the internal guidance 
sequence (IGS) required for splicing activity and 

25 inserts a extra glycine residue at the 3' end of the VH 
domain. TlfoSac primes at the 3' end of the self 
splicing intron and conserves the thymidine base just 
3' of the self splicing intron which, though not part 
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of the intron, is present in Tetrahymena DNA. TlfoSac 
inserts a extra Gly and Ser residue at the 5' end of 
the VL creating a 5 amino acid linker. 

The self splicing intron was amplified with the 
5 primers TlbaBstEII and TlfoSacI using standard 

conditions (see eg exanple 14 of PCT/GB93/02492) . The 
product of the PGR reaction was digested with 
restriction enzymes Sad and BstEII and ligated into 
BstEII/SacI digested pUCll9Dl,3 or pUClSNQll in a 

10 molar ratio 4:1 {SSI:pUCll9D1.3 or pUC19NQll) and the 
resulting ligation mixes used to transform E. coli TGI 
cells. Recombinants were screened for inserts of 
correct size using primers specific for self splicing 
intron, TlfoSac and TlbaBstEII. 

15 Soluble diabody was expressed by , growth at 37'C. 

Cells in log phase growth in 2 mL 2YT/0-l% glucose/100 
;xg mL** ampicillin were induced by adding IPTG to a 
final concentration of ImM IPTG and grown 3 hours 22^*0. 
The cells were centrifuged (lOOOg 10 minutes) and the 

20 cell pellet resuspended in 100^1 ice cold PBS/lmM EDTA 
and left on ice, 60 minutes. The cell suspension was 
centrifuged (lOOOg for 10 minutes) and the diabody- 
containing supernatant used in ELISA on lysozyme and 
phOx (as described in example 1 of PCT/GB93/02492) . 

25 The ELISA signal (absorbance at 405nm) was 

equivalent (greater than 1.0 after 10 min) for the 
spliced 5 amino acid linker D1.3 diabody to that 
obtained with the 5 amino acid linker D1.3 diabody 
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(constructed in example 1 of PCT/GB93/02492) . However 
for the spliced 5 amino acid linker NQll diabody the 
signal was much lower (0.2 compared to 2.0 after 20 
rain) when con?>ared to the 5 amino acid linker diabody 
constructed in example l of PCT/GB93/02492 . There 
three possible explanations for this: 

- the NQll diabody is not functional with the GLSSG 
linker sequence, although this appears unlikely; 

- self- splicing does not work properly in the case, of 
the diabody NQll because the DNA sequence 3' of the 
intron (at the 5' of the VL domain) is not suitable for 
self splicing. Whereas the D1.3 sequence at 5' end of 
the VL domain gene is efficient at allowing self 
splicing, the NQll sequence in this region is poor; 

- there is a cryptic splice site in this construct. 



splicincT intron incluHir,rr a loxP exeis^rf ^o T^ ^.r^ 

^ six amino a^jd linker befcw^Pn yfj and VT. H^maint, of 
bivalent diabodiefft 

20 The primers Tlba2BstEII and Tlfo2SacI were 

designed to introduce into the NQll construct sequences 
3' of the self splicing intron which should enable 
efficient self splicing at the RNA level. 

The self splicing intron was amplified with 

25 Tlba2BstEII and Tlfo2SacI by PGR. This intron was 

inserted between the VH and VL domain genes of antibody 
NQll and creates upon splicing out an open reading 
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frame encoding a diabody with linker VH-GSLKVG-VL. 
Without splicing no functional diabody can be produced 
as the self splicing intron contains several stop 
codons in 3 reading frames. 
5 A restriction site for BstEII was incorporated at 

the 5' end of the primer Tlba2BstEII and a Sad 
restriction site introduced in the primer Tlfo2Sac. 
This allowed the self splicing intron fragment to be 
cloned in a 2 -way ligation reaction into the expression 

10 vector pUClSNQll cut with BstEII and Sad. Tlba2BstEII 
primes at the 5' end of the self splicing intron and 
conserves the bases at the 5' splice site which pairs 
with the internal guidance sequence (IGS) required for 
splicing activity and inserts a extra glycine residue 

15 at the 3' end of the VH. Tlfo2Sac primes at the 3' end 
of the self splicing intron and conserves the thymidine 
base just 3' of the self splicing intron which, though 
not part of the intron, is present in Tetrahymena DNA 
and inserts a extra Gly and Ser residue at the N- 

20 terminal end of the VL domain. 

The self splicing intron used in this case 
contained a lox P site inserted between bp 236 and 237. 
It was amplified with the primers Tlba2BstEIl and 
Tlfo2SacI using standard conditions. The product of 

25 the PGR reaction was digested with restriction enzymes 
Sad and BstEII and ligated into BstEII/SacI digested 
pUClSNQll in a molar ratio 4:1 (SSI :pUC19NQll) and the 
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resulting ligation mix used to transform E. coli TGI 
cells. 

Recombinants were screened for inserts of correct 
size using the primers specific for self splicing 
5 intron, Tlfo2Sac and Tlba2BstEII. 

Soluble diabody was expressed as above and 
assayed by ELISA. In this case an equivalent signal 
(greater than 1.0 after 10 min) was obtained with the 6 
amino acid linker NQll diabody formed by self splicing 
10 as for the 5 amino acid linker diabody constructed in 

example 1 of PCT/GB93/02492 . Thus this strategy allows 
more efficient self splicing in the NQll construct. 

EXAMPLE 2: USE OF SELF-SPLICING INTRONS IN THE 

CONSTRUCTION OF A SINGLE CHAIN FV CLONE 
15 In the work described in this example, a self- 

splicing intron is introduced between the VH and VL 

domain genes of an antibody, D1.3, cloned in the single 

chain Fv format, directed against hen egg lysozyme. 

This self -splicing intron is shown to be spliced out 
20 following transcription, as determined by the 

expression of a functional single chain Fv molecule 

with a 15 amino acid linker. 

Construction of D1.3 clone containing a self -splicing 
intron. excised to leave a fifteen am ino acid linker 
25 between VH and VL domains of a single chain Fv molecule 
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The self-splicing intron from Tetrahymena (T.R. 
Cech Ann. Rev. Biochera, 59 543-568, (1990)) has been 
shown to be able to splice in the E. coli cytoplasm. 
It is inserted between the genes encoding the VH and VL 
5 domains of the antibody D1.3 in such a way as to create 
upon splicing out an open reading frame encoding a scFv 
with linker VH-GGGGSGGGGSGLSSG-VL. Without splicing no 
fxanctional scFv can be produced as the self -splicing 
intron contains several stop codons in three reading 
10 frames. 

A restriction site for BstEII is incorporated at 
the 5' end of the primer TlbascFvBstEII and a Sad 
restriction site is introduced in the primer TlfoSac. 
This allows the self -splicing intron fragment to be 

15 cloned in a 2 -way ligation reaction into the expression 
vector pUC119D1.3 (encoding the V domains of the D1.3 
anti-lysozyme antibody: Holliger et al (1993) supra) 
each cut with BstEII and Sad. 

TlbascFvBstEII primes at the 5' end of the self- 

20 splicing intron and conserves the sequences at the 5' 
splice site which pair with the internal guidance 
sequence (IGS) required for splicing activity, and 
inserts an extra 10 amino acid residues at the 3' end 
of the VH. TlfoSac primes at the 3' end of the self- 

25 splicing intron which, though not part of the intron, 
is present in Tetrahymena DNA and inserts extra serine 
and glycine residues at the N-terminal end of the VL 
domain . 
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The self-splicing intron used in this case 
contained a lox P site inserted between bp 236 and 237. 
It was amplified with the primers TlbascFvBstEII and 
TlfoSacI using standard conditions. The product of the 
5 PCR reaction was digested with restriction enzymes Sad 
and BstEII and ligated into BstEII/SacI digested 
PUC119D1.3 in a molar ratio 4:1 {SSI :pUC19NQll) and the 
resulting ligation mix used to transform E. coli TGI 
cells . Recombinants were screened for inserts of 

10 correct size using the self -splicing intron specific 
primers TlfoSac and TlbascFvBstEII. 

Soluble single chain Fv is expressed as in 
example 1 and assayed for ability to bind lysozyme by 
ELISA. A signal of greateer than 1.0 is obtained after 

15 10 minutes. Hence, self -splicing introns may be used 
in nucleic acid encoding single chain Pv molecules. 

EXAMPLE 3: CONSTRUCTION OF A DIVERSE REPERTOIRE OF 25 
AMINO ACID PEPTIDES (CONTAINING 20 VARIED RESIDUES) 
DISPLAYED ON PHAGE USING LOX P RECOMBINATION SITES 

20 WITHIN SELF SPLICING INTRONS 

In the work described this example a diverse 
repertoire of 25 amino acid peptides (consisting of two 
variable 10 amino acid peptide sequences separated by 
five constant amino acids) displayed on bacteriophage 

25 was prepared by the recombination of two separate 
repertoires of 10 amino acid peptides cloned in 
separate replicons^ Recombination between the lox P 
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sites under the control of the Cre recoinbinase allows 
their sequences to be linked. The final repertoire 
thus prepared combines the diversity of the two peptide 
libraries (Figure 6) . 

5 Construction of the vector fdPOG-PEP 

The VHCH fragment of the antibody NQlO/12.5 was 
amplified from the vector pUC19 NQIO k using oligo 
3249/ which introduces the lox P 511 site upstream of 
the pelB leader sequence and an ApaLI restriction site 

10 (see Table 1 and Figure 10) and oligo LMB2. The 

resulting fragment was then cloned into fdDOGl. (T. 
Clackson et al, supra) cut with ApALI and Notl. The 
group I self -splicing intron from Tetrahyznena (T.R. 
Cech et al Structural Biology 1 273-280, 1994) 

15 containing a wild type lox P site (between nucleotide 
236 and 237) was amplified with oligo 3189 .(which 
introduces a EcoRI restriction site) and oligo 3193 
(which includes the random oligonucleotide (NNK)^^ and a 
Notl restriction site) . The resulting fragment was 

20 then cloned into fdDOG-BLX cut with Sfil and Notl to 
create the vector fdDOG-PEP. 

Construction of the vector PUC19-PEP 

The group I self -splicing intron from Tetrahymena 
containing a wild type lox P site (between nucleotide 
25 236 and 237) was amplified with oligo 3194 (which 

introduces a EcoRI restriction site and includes the 
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random nucleotide (NNK)jo (Table 1, Figure 11) and oligo 
3198 (which introduces a Sfil restriction site. The 
resulting fragment was then cloned into pUC19-21ox (P. 
Waterhouse et al, 1993 supra) cut with Sfil and EcoRI 
5 to create the vector pUC19-PEP (Figure 11) . 

Combinatorial infection and in viva recombination 

To create a large combinatorial repertoire of 25 
amino acid peptides (with 20 amino acids displayed 
varied) on a f d phage the strategy of combinatorial 

10 infection and in vivo recombination was used (P. 

Waterhouse et al Nucleic Acids Res, 21. 22€5-226€, 
1993) . This system uses the lox-Cre site-specific 
recombination system to bring the two 10 amino acid 
repertoires together on the same replicon, separated by 

15 a self -splicing intron. 

10' E.coli TGI, harbouring the library of 10 amino 
acid peptides in fdDOGPEP was used to inoculate 1 litre 
of 2xTy broth containing 12.5/ig/ml tetracycline (2xTY- 
TET) and the culture shaken for 30 hours at 30**C in two 

20 500ml aliquots in 2 litre baffled Erlenmayer flasks. 
Phage were purified from the supernatant by 
precipitation with polyethylene glycol (J. McCafferty 
et al, Nature 348 552-554, 1990), resuspended in PBS 
(phosphate buffered saline (phosphate buffered saline: 

25 25mM NaHjPO,, 125mM NACl, pH7.0). Phage were titred by 
infecting exponential phase E.coli TGI (30 min, 37^C) 
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and plating on TYE-TET. Yields are typically 6 x 10" 
t.u. per litre of culture. : 

2.4 X 10® E.coli harbouring the plasmid 
pACYCara/Cre (Example 4) and the library of 10 amino 
5 acid peptides cloned in pUC19 PEP were used to 
inoculate 200 ml of 2xTY containing 100 /xg/ml 
carbenicillin, 25 /ig/ml chloramphenicol, 2g/l glycerol 
and 1% glucose {2xTYCaChglyglc) and grown overnight at 
37°C with shaking. 10 ml aliguots of the overnight 

10 culture were used to inoculate 10 x 1 litre culture of 
2xTYCaChglyglc in 21 Erlenmeyer baffle flask and the 
culture grown with shaking at 37**C to A^oo of 0.4. 

1.4 X 10" t.u. of fdDOG PEP library were added to 
each Erlenmeyer baffle flask and incubated for 10 mins 

15 at 37°C without shaking. The 2xTYCaChglyglc containing 
the infected cells were then filtered through a 0.4 5 fim 
tangential flow filter (PELLICON cassette, MILLIPORE) , 
and resuspended in 10 x 1 litre 2xTy containing 100 
/xg/ml carbenicillin, 25 fiq/ml chloramphenicol, 15 

20 fig/ml tetracyclin, 2g/l glycerol and 0.5 g/i L( + ) 

arabinose {2xTYCaChTetglyara) in 2 litre Erlenmeyer 
baffle flasks and the culture grown with shaking at 30°C 
for 36 hrs. A sample was taken before growth to 
determine the library size by plating on 2xTY agar 

25 plates containing carbenicillin, chloramphenicol and 

tetracyclin. There were 4.7 x 10^** independent clones. 

The culture was then filtered as before. The 
recorabined phage, in the filtrate, were precipitated 
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using PEG/NaCl and resuspended in a final volume of 26 
ml PBS. The phage were titred by infecting exponential 
phase E.coll (30 mins, 37**C) and by plating on TYE-tet. 
The yield obtained was 6,0 x 10" t.u. total (the fdlXXS- 
5 REC library glycerol stock) . To determine the 

frequency of recombination, a PCR screen was performed 
by amplifying DNA from individual colonies using oligos 
4226 and pelBBACK (Table 1) . 13 clones out of 50 
screened gave a band on electrophoresis on a 6% 

10 polyacryl amide gel whose mobility corresponds to a size 
of 314 base pairs (the expected size from recombined 
phage) and the others a band whose mobility corresponds 
to a size of 284 base pairs (the expected size from 
unrecombined phage) . The recombination frequency was 

15 thus 26%. As there are multiple copies of plasmid and 
phage replicons in each bacterial cell when Cre 
recombinase is induced to promote recombination, and at 
least 60 phage are produced per bacterium after 
overnight growth, we believe that each bacterium should 

20 yield at least one phage containing the peptide from 
the donor vector and that the overall library size is 
4.7 X 10" clones. 

Propagation of phage from th e recombined library 

10 litres of 2xTY-TET were inoculated with a 35ml 
25 aliquot of the recombined fdDOG-REC library glycerol 

stock (2.4 X 10" c.f .u) . The cultures were grown with 
shaking overnight at 30**C in baffled flasks (1 litre 
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medium per flask) . The cultures were centrifuged at 
5000g for 15 min at 4**C, the fd phage precipitated from 
the supernatant using polyethylene glycol and each 
repertoire resuspended in a final volume of 10 ml PBS. 
5 Total phage yields (from 10 litres) are typically 
aroxand 10" t.u. 

In yitro splicing of the in tron within the recombined 
phage 

To test for the splicing of the intron within the 

10 recombined phage, 5 clones out of 31 positive 

recombined clones were amplified using oligo-3520 and 
fdSEQl. The size of the product after PGR was 619 base 
pairs (expected size for a recombined phage) and 589 
base pairs (expected size for a unrecorabined phage) . 

15 The in vitro transcription was performed on 5 clones 

using an in vitro transcription kit (Promega, Riboprobe 
II core System TV RNA Polymerase, cat.#P2590) according 
to the manufacturer's instructions (1 unrecombined and 
4 recombined) . The samples were boiled and 

20 electrophoresed on a 6% polyacrylamide gel. 

All 4 recombined clones showed a band 
corresponding to the spliced exon (198bp) ; and the 
unrecombined one gave a band whose mobility corresponds 
to 168bp (spliced exon) . These results indicate that 

25 the splicing reaction occurs in the unrecombined phage 
as well as in the recombined one. 
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Selection of clones from ^-he library 

The peptide library displayed on phage was 
selected for the ability to bind an anti-p53 antibody 
(Pab240) which recognize a linear epitope on the 
5 surface of the cell with the amino acid sequence RHSV 
(C.W. Stephen & D.P, Lane *7. Afol, Biol 1992 225 577- 
583) . 

The selection was performed on Immunotiibes (Nunc; 
Maxisorp) coated with the anti-p53 antibody coated at 

10 lO^g/ml using methodology as previously described (J.D, 
Marks et al., J. Mol, Biol., 222, 581-597. 1991; A.D. 
Griffiths et al., (1993) EMBOJ., 12, 725-734). Four 
rounds of growth and selection were performed for 
binding of peptides displayed on phage to the anti-p53 

15 antibody on Immunotubes using methodology as described 
by A.D. Griffiths et al (1994) EMBO J., 13 3245-3260). 
The ability of phage from single isolated clones to 
bind to anti-p53 antibody was assessed by ELISA on 
plates coated with smtibody p53. Phage were prepared 

20 as described by McCafferty et al (supra) and ELISA was 
performed as described by Griffiths et al, (1993 supra) 
except that the second antibody used was an anti-sheep 
antibody coupled to alkaline phosphatase. 

31 clones giving positive ELISA signals were 

25 amplified by PGR using oligo 3870 and fd SEQl (Table 

1) . Aliquots were analysed by electrophoresis on a 1% 
agarose gel. The remaining product was purified using 
Magic PGR Preps (Promega) and used in PGR cycle 
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sequencing reactions with fluorescent dideoxy chain 
terminators (Applied Biosystem) and oligos 4445 and 
3358 according to the manufacturer's instructions. The 
sequences are shown on taible 2. 
5 To check that the selected clones were specified, 

the same phage from single isolated clones were assayed 
by ELISA for binding to antibodies with the same 
isotype as Pab240 (IgGl) and either lambda and kappa 
light chains (Fog-1 and Fog-B) . The ELISAs showed that 
10 none of the selected clones cross-reacted with these 
antibodies. 

It was concluded that the same epitope RHSV is 
selected as a consensus sequence selected from the 
phage peptide library as described by Steven & Lane 

15 (1992, supra). Of the 31 selected peptides displayed 

on phage, 8 included the sequence RHSV, 4 KHSV and 5 (R 
or K)HS(L or I) and 3 (R or K)HSX. 

Thus a large phage display library of ca. 5 x 10" 
recombined 25 amino acid peptides may be constructed 

20 using recombination between loxP sites contained in a 
self splicing intron. This method should be 
particularly valuable for selecting, for example, 
peptides involved in binding to receptors. Constrained 
peptide libraries could be made by incorporating a 

25 cysteine residue iii each of the 10 amino acid peptides 
to be recombined so that a disulphide bond is formed 
and the peptides between the cysteine would form a 
loop. The amino acid linker could be varied in length 
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and the reading frame. 

This example demonstrates the feasibility of 
making a large library which recombines two exons into 
5 a longer continuous sequence. This methodology for 
making a repertoire may be applied to other molecules, 
including, for exan5)le, single chain Pv fragments and 
diabodies . 

EXAMPLE 4: MUTATION OF THE 3' SPLICE AND INTERNAL 

10 GUIDING SEQUENCE OF A SELF SPLICING INTRON CONTAINING A 
LOXP SITE TO ENCODE A NEW DIABODY LINKER WHICH IS 
COMPATIBLE WITH HIGHER EXPRESSION. 

To utilise recombination by loxP in the 
construction of antibody repertoires a loxP site csm be 

15 included between the two antibody domains, VH and VL of 
a single chain Fv fragment, in a continuous open 
reading frame, employing the amino acid sequences 
encoded by those loxP sequences as a linker. In this 
case the choice of linker is dictated by the length and 

20 sequence of the loxP sites used. An alternative 

strategy is to err^loy RNA splicing of a group I self 
splicing intron inserted between the VH and VL. A 
recombination site such as loxP may be inserted within 
the intron so that the amino acid sequence encoded by 

25 the site is spliced out from the RNA after expression 
and is therefore not incorporated into the final 
expressed protein. 
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When a group I intron is deleted by self -splicing 
process, a residue of the intron, derived from the 5' 
and 3' splice sites (which pair with the internal 
guiding sequence in the PI and PIO hairpin loops 
5 respectively) , remains within the coding region of the 
polypeptide. Successful splicing is dependent on base 
pairing in the PI and PIO hairpin loops involving the 
internal guiding sequence (IGS) . 

This example demonstrates that the 3' splice site 

10 and the internal guiding sequence may be mutated so 

that following splicing the amino acids encoded by the 
RNA are altered. These amino acids contribute to a 7 
amino acid (dic±>ody) linker which is compatible with 
higher level expression. It is further shown that the 

15 mutated 3 'splice site can be used in the construction 
of a single chain Fv molecule containing a 15 amino 
acid linker. In this example, vectors encoding scFv 
fragments or diabodies directed against the hapten NIP 
( 3 - iodo- 4 - hydroxy- 5 -nitrophenyl- acetate) are 

20 constructed and expressed using self splicing introns 
which include loxPWT sites to link the VH and VL 
domains . 



1. Construction and expression of anti-NIP diabodv 
from an expression vector containing loxP in a self 
25 splicing intron, and mutation of the PIO h airpin loon . 

A diabody expression vector containing loxP in 
its self splicing intron is shown in Figure 12. 
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Salient features of the construction of this vector are 
•given below. The intron was an^lified by PGR from the 
vector pUS19Tet-intron-loxP (which contains the loxPWT 
sequence inserted between bp236"237 of the Tetrahyraena 
5 ICEIO intron sequence) using #3312 intron- lox-back and 
3463 intron-for-2 oligos (Table 1) which contain the 
sequences of the 5' splice site and the internal 
guiding sequence of the pi hairpin loop flanked by a 
Xhol and Ncol sites at the 5' end, and 3' splice site 

10 of the PIO hairpin loop flanked by an ApaLI site and 

NotI at the 3' end respectively. The amplified product 
was cloned as a NcoI-EcoRI fragment into pUC19-21ox 
(Waterhouse et al, 1993 supra). The intron is flanked 
by Xhol and ApaLI sites. 

15 For the experiments described in this example, 

the VH and VL genes originate from the Fab fragment 
clone G6 (anti-NIP; A.D. Griffiths et al EMBO J. 13 
3245-3250, 1994). The VH gene was cloned into the pUC 
vector derivative as a Ncol-Xhol fragment. Promoter 

20 sequences for T7 RNA polymerase were introduced into 
the Hindlll site and were flanked by Sail and Hindlll 
sites. The sall-NotI fragment containing the VH-NIP, 
self-splicing intron, loxP sites and T7 polymerase 
promoter was now subcloned from the pUC vector 

25 derivative into fd-DOGl (Clackson et al Nature 352 624- 
628, 1991) which had its ApaLI site converted to a Sail 
site. The VL gene of G6 was cloned in as a ApaLI-NotI 
fragment. An AscI site was subsequently introduced at 
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the 3 'end of the VL gene with a loxP3 site and a Factor 
X protease cleavage site between this AscI site and a 
NotI site at the 5' end of gene III. The resulting 
construct fdDWT/3 is shown in Figure 12. 
5 After splicing the RNA transcribed from fdDWT/3 

encodes the polypeptide chain of an ant i -NIP diabody 
with a seven amino acid linker SLKVSAL (Figure 12a) . 
TGI cells were transformed with fdDNA encoding the 
construct and phage were prepared as described in A.D. 

10 Griffiths et al (1994, supra). This diabody was poorly 
expressed. The phage titer was lower than 10'' TU/ml (at 
least a hundred fold lower than would be normally 
expected) . There was no detectable signal in the phage 
ELISA for binding to NIP-BSA performed as described by 

15 Griffiths et al. (1994, supra). However, the intron 
was shown to be spliced correctly as shown by 
sequencing of the cDNA made from the spliced 
transcript . 

To test for intron splicing the vector was first 
20 amplified by PGR with the primers fd-PCR-Back and 

BamHI-for to prepare the DNA template containing the T7 
promoter sequence (Table 1) . From this template RNA 
was prepared using an in vitro treinecription kit 
(Promega, Riboprobe II core System T7 RNA Polymerase, 
25 cat.#P2590) . The original DNA ten^late was first 

removed by digestion with DNasel, and cDNA was then 
prepared using the First-Strand cDNA Synthesis Kit 
(Amersham) . The cDNA was amplified by PGR with 
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VH3BackSfi and JK-FOR primers (J.D. Marks et al, J. 
Mol. Biol. 222 581-597, 1991), and was sequenced using 
the same primers. . The sequence obtained demonstrated 
accurate splicing resulting in an SLKVSAL linker in the 
5 expressed diabody product. 

To improve expression, amino acids more 
coir^atible as linkers for the expression of diabody may 
be identified and used to alter the bases of the splice 
sites. To this end a second anti NIP diabody was 

10 constructed in which the first G within the 3' splicing 
signal (PIO) was mutated to C. To enable perfect 
pairing with the IGS the corresponding C within the ICS 
was changed to G (Pigurel2d) . The intron of the vector 
pUC19Tet-intron-loxP was anplified by PGR with a second 

15 set of primers, #3877 encoding Pi and the mutated C to 
G in the IGS, and #3878 encoding PIO having a G to C 
mutation (Table 1) . The intron was cloned as above to 
give an analogous fdDWT/3 construct, but in this case 
after splicing of the intron, the resulting RWA encodes 

20 the linker VH-SLNVSAL-VL (Figure 12b) . The splicing of 
the mutated intron was tested by the cDNA sequencing of 
expressed RNA as above. 

The mutation of K to N in the diabody linker 
dramatically improved the expression of the diabody, 

25 displayed on phage fd, resulting in a phage titer in 

the range of 5xl0^-i0^/ml and a phage ELISA signal for 
binding to NIP-BSA in the range of 1 absorbance unit. 
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Since diabodies require two polypeptide chains to 
form the antigen binding site (P. Holliger et al 1993, 
supra) and the only diabody polypeptide chains present 
are fused to gene III -protein, the ELISA signal 
5 indicated that some diabody polypeptide chains are 
cleaved from the fusion and combine with the glll- 
diabody polypeptide fusion retained on the surface of 
the phage, to form a functional bivalent diabody which 
can bind to a NIP. Western blots were performed of 

10 phage proteins with detection with an antibody directed 
against gene 3 protein as described by J. McCafferty et 
al (Protein Engineering 4 955-961, 1991) . This gave 
the relative proportions of glll protein-diabody 
polypeptide fusion to cleaved fusion migrating at the 

15 position of native glll protein to be 40% and 60%, 
respect ively , 

^ Expression of ant i -NIP snFV from an eypreseion 

vector containincr loxP in a self a plicina intron, and a 
mutation of the PIO hairpin loyp - 

20 Since the SLNVSAL linker, derived by splicing of 

the mutated PIO hairpin loop, was compatible with the 
(high) expression of a diabody, a single chain Fv 
construct was made with a 15 amino acid linker which 
utilises the same mutated PIO hairpin loop for the 

25 self-splicing intron. The self -splicing intron 

containing the loxP site is. spliced out to give the 
amino acid sequence GGGGSLNVGGGGSAL (Figure 12c) . 
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The self splicing intron was amplified by PGR 
from the vector pUC19Tet-intron-loxP using the 
oligonucleotides 4243 and 4244 (Table 1) . These 
contain bases encoding a stretch of four glycine 
5 residues flanking the 5' and 3' splice sites 

respectively. Oligonucleotide 4243 contains the 
mutation of the internal guiding sequence and 
oligonucleotide 4244 and the mutation of the 3' splice 
site, to effect the K to N mutation as above. The 

10 intron is spliced out after transcription and there is 
functional display of ant i -NIP scFv fragments on the 
surface of phage fd as determined by phage ELISA on 
NIP-BSA with an absorbance of 1.0. Further, the phage 
titre was in the range of 5 x 10« - i x 10' TU/ral, 

15 indicating that these phage fd clones grew well. 

Thus, mutations may be made at the 3' splice site 
and internal guiding sequence of the self splicing 
intron to allow the encoding of aminp acids compatible 
with higher expression on self -splicing. Depending on 

20 the amino acid which it is desired to alter, it may be 
necessary to mutate the bases of the Pi hairpin loop as 
well as the PIO hairpin loop or the Pi hairpin loop 
only . 

EXAMPLE 5: CONSTRUCTION OF THE PLASMID pACYCaraCre 
25 EXPRESSING CRE RECOMBINASE UNDER THE CONTROL OF AN 
ARABINOSB PROMOTER 
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In the work described in this example, a plasmid 
was constructed in which Cre recombinase is expressed 
under the control of a promoter inducible by arabinose. 
The origin used pl5A makes it suitable for use in 
5 combination with plasmids with ColEl origin and with 
phage or phagemids with phage origins. 

A fragment was an?)lified by PGR from pUCllS 
(Vieira, J. and Messing, j. (1987). Methods in 
Enzymol. 153, 3-11) using the primers lacfor2 and 
10 lacback2 . This fragment extended from within the lad 
gene fragment (inactive) to the polylinker of pDC119 
and the primers incorporate a series of restriction 
sites at both ends of the fragment. 

This PGR fragment was cut with PvuU and JCasI and 
15 re-cloned into pUCllS digested with the same enzymes to 
generate pUCllSlacipoly. 

PARA14 (Cagnon, C. , Valverde. v. and Masson, J.- 
M. (1991). Protein Engineering 4, 843-847) was digested 
with Sad and Ncol to release a fragment containing the 
araC gene and the promoter-operator region of araB. 
This fragment was ligated into pUC1191acipoly cut with 
the same enzymes to generate pUC119ara. 

The Cre recombinase gene was amplified by PGR 
from bacteriophage PlCro d.lOO r'm' (Yarroolinsky, M.B., 
25 Hansen, E.B.. Jafri, S. and Chattora j , D.K. (1989). J. 
Bacteriol.. 171. 4785-4791) using the primers crefor 
and creback. After digestion with Bsal and Kpnl this 



20 
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fragment was ligated into pUC119ara cut with Ncol and 

Kpnl to generate pDCllSaraCre . 

Finally, the Pvull-Hindlll fragment of 
pUC119araCre containing the araC gene and the Cre 

recombinase gene under the control of the promoter- 
operator region of araB was subcloned into pACYC184 

(Chang. A.C.Y. and Cohen, S.N. (1978). J. Bacterid., 
134, 1141-1156) cut with BsaBI and Hindlll, thereby 
replacing the tetracycline resistance gene of pACyci84. 
The plasmid produced (pACYCaraCre) thus contains the an 
arabinose inducible Cre gene on a plasmid with a plSA 
origin of replication. This plasmid can co-exist in E. 
coli with both the heavy chain donor vector (which has 
a ColEl origin) and with the acceptor vector (which has 
a filamentous phage origin) and is useful for the 
generation of a large phage display library in the lox 
P format. 



EXAMPLE 6: MODEL EXPERIMENTS FOR THE CONSTRUCTION OF A 
DZABODY REPERTOIRE USING THE FD3L0X SYSTEM, USING A 

20 LOXP SITE WITHIN A SELF-SPLICING INTRON 

In this example, model experiments are described 
which demonstrate that the loxP site within the self- 
splicing intron may be used in the construction of a 
diabody or single chain Pv repertoire by recombination 

25 of VH and VL gene repertoires. To this end model 

experiments are described using a fd phage acceptor 
vector containing 3 lox sites encoding an anti-NIP 
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diabody molecule, where recombination is performed with 
donor vectors encoding VH or VL domains. Recombination 
of the diabody cassette with an expression vector is 
also demonstrated. The methods of this example are 
equally applicable to the construction of single chain 
Fv repertoires using 15 amino acid linkers as described 
in Example 4, and other polypeptides. 

The fd phage acceptor vector, fdDWT/4 containing 
3 lox sites is shown in Figure 13. It contains the VH 
and VL genes of the anti-NIP clone G6 (Griffiths et al, 
1994 supra) . The sites loxPSll and loxPWT flank the VH 
gene and the sites loxPWT and loxP4 flank the VL gene. 
The loxPWT site is in the self splicing intron and the 
loxP4 site sits between the VL gene and gene III. The 
15 diabody or single chain Fv polypeptide chain encoded is 
expressed as a fusion with the gene III protein. A 
site for the factor X protease is included between the 
VL gene and gene III to allow the possibility of the 
elution by proteolysis of phage from antigen during 
20 selection procedures. Alternative versions of fdDWT/4 
were also made with the site loxP4 replaced with loxP3 
and loxPl respectively (see Figure 13) . 

If, for example, a VL gene repertoire is first 
cloned into fdDWT/4 as ApaLI-AscI fragments, a VH gene 
25 repertoire may then be introduced by recombination with 
a donor vector containing the VH gene repertoire, 
flanked by loxP511 and loxPWT sites. 
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fdDWT/4 was recombined with the donor vector pDN8 
containing the VH-DIO gene flanked by loxPSll and 
loxPWT sites. This was performed by transforming 
E.colx TGI pACYCaraCre (Example 5) with pND8 donor 
vector containing VH-DlO and then infecting with 
fdDWT/4 phage containing the genes encoding the 
variable domains, VH-G6 and VL-G6. Recombination was 
allowed to continue at 30«>C overnight. Recombined phage 
from the bacterial supernatant were used to infect TG- 
1. As a result of recombination between the loxPSll 
sites of donor and acceptor and between the loxPWT 
sites of the donor and acceptor, the recombined fd 
phage contains VH-DlO while keeping the original VL-G6. 

Successful recombination was analysed by PCR 
screening of Individual fd phage clone colonies by 
amplification using oligonucleotides that prime 
specifically on the sequences encoding the VL-G6 and 
VH-DlO CDR3S present in the donor vectors. Thus a PCR 
product is only observed when recombination has 
occurred. The recombination efficiency was 75%. 
Similar experiments recombining fdDWT/3 or fdDWT/l with 
pDN8 gave similar efficiencies (Figure 13) . 

Alternatively, a VH gene repertoire may be cloned 
between the Ncol and Xhol sites of fdWT/4 and a VL 
repertoire, flanked by loxPWT and loxP4 sites. 

fdDWT/4 was recombined with the donor vector 
pRWT/4 containing the VL-DIO gene flanked by loxPWT and 
10XP4 sites. This was performed by transforming TGI 
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pACYCaraCre (Example 5) with pRWT/4 donor vector 
containing VL-DIO and then infecting with fdDWT/4 phage 
containing the genes encoding the variable domains, VH- 
G6 and VL-G6. Recombination was allowed to continue at 
5 30»C overnight. Recombined phage from the bacterial 
supernatant were used to infect TG-1. As a result of 
recombination between the loxP4 sites of donor and 
acceptor and between the loxPWT sites of the donor and 
acceptor, the recombined fd phage contains VL-Dlo while 
10 keeping the original VH-G6. 

Successful recombination was analysed by PCR 
screening of individual fd phage clone colonies by 
amplification using oligonucleotides that prime 
specifically on the sequences encoding the VH-G6 and 
15 VL-DlO CDR3S present in the donor vectors. Thus a PCR 
product is only observed when recombination has 
occurred. The recombination efficiency was . less than 
10%. Similar experiments recombining fdDWT/3 or 
fdDWT/1 with the donor vectors pRWT/3 or pWT/l (where 
20 the 10XP4 site of pRWT/4 is replaced by the loxP3 or 
loxPl site respectively) gave efficiencies of 0% and 
96% respectively. 

Since in fdDWT/4 the diabody polypeptide is only 
made as a fusion with gene III, phage displayed 
25 bivalent diabody results from the association of free 
diabody polypeptide, cleaved from gene III protein, 
with diabody polypeptide gene HI fusion, it is 
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desirable to express the bivalent diabody directly as a 
soluble moleciile. 

To test the feasibility of subcloning directly 
into an expression vector by recombination using the 
5 loxP sites, the expression vector pEX511/4 was 

constructed (Figure 13) . This contains the S12 gene, 
which confers streptomycin sensitivity on bacteria, 
flanked by loxPSll and loxP4 sites, E.colx TGI 
pACYCaraCre (Example 5) were transformed with pEX511/4 
10 and then infected with fdDST/4 containing the genes 
encoding the variable domains, VH-G6 and VL-G6. 

Recombination is allowed to continue at SC^c 
overnight and the cells were replica plated on 2xYT 
agar with or without streptomycin. If recombination 
15 has occurred the genes encoding the diabody polypeptide 
will have replaced the streptomycin sensitivity gene in 
pEX511/4. This will make the bacteria streptomycin 
resistant. 

The recombination was shown to have taken place 
20 with an efficiency of 40 to 70%. similar experiments 
were performed where fdDWT/3 or fdWT/1 were recombined 
with PEX511/3 or pEX511/l (where the loxP4 site of 
PEX511/3 was replaced with the loxP3 or loxPl site 
respectively) . No recombination was observed. 
25 Thus it is demonstrated that recombination can be 

performed between loxP sites configured for the 
construction of diabody or single chain Fv repertoires. 



68 

*A preferred approach to construction of a 
repertoire may be first to clone the VL genes as ApaLI- 
Ascl fragments into fdDWT/4 and then recombine with a 
VH repertoire as Ncol-Xhol fragments in pDN8. This 
5 would generate a diabody repertoire (or single chain Fv 
repertoire, if modified slightly) suitable for phage 
display. Following selection of diabodies, individual 
or pooled clones could be subcloned into pEX5li/4 for 
soluble expression, in conjunction with the results 

10 from the peptide display chain Fv expression from 

clones containing self- splicing introns with loxP sites 
in examples 1, 2 and 4, it is concluded that this 
methodology is suitable for making large diabody and 
single chain Fv repertoires of the order of lo" to 10" 

15 or more independent clones. 
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CLAIMS* 

1. A DNA construct comprising a first exon sequence 
of nucleotides encoding a first peptide or polypeptide, 
a second exon sequence of nucleotides encoding a second 
5 peptide or polypeptide and a third sequence of 

nucleotides between the first and second sequences 
encoding a heterologous intron between RNA splice sites 
and a site-specific recombination sequence within the 
intron, the exons together encoding a product peptide 
10 or polypeptide. 

2. A DNA construct according to claim 1 wherein the 
product peptide or polypeptide comprises a member of a 
specific binding pair (sbp) . 

3. A DNA construct according to claim 2 wherein the 
15 sbp member comprises a binding domain able to bind 

complementary sbp member. 

4 . A DNA construct according to claim 3 wherein the 
binding domain is an immunoglobulin antigen-binding 
site. 



5. A DNA construct according to claim 4 wherein the 
product peptide or * polypeptide is a scFv antibody 
fragment which coirprises a VH domain linked to a VL 
domain via a peptide linker which allows the VH and VL 
domains to associate to form the antigen-binding site. 
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6- A DNA construct according to claim 5 wherein 
transcription of said construct leads to raRNA having 
nucleotides corresponding to nucleotides of said RNA 
splice sites which encode and are translatable into 
amino acids of the peptide linker of said scFv antibody 
fragment . 

7. A DNA construct according to claim 1 wherein the 
product peptide or polypeptide comprises a T cell 
receptor Va domain and a T cell receptor domain, or 
a T cell receptor/antibody fusion, or a T cell 
receptor/antibody fragment fusion, or a receptor 
binding peptide, or an enzyme, or a multiple domain 
protein, or an amino acid sequence variant or 
derivative of any of these. 



15 8. A DNA construct according to claim 1 wherein the 
first and second peptides or polypeptides are not 
linked together in any naturally occurring polypeptide. 

9. A DNA construct according to claim 8 wherein one 
of said first and second peptides or polypeptides 
20 comprises an antibody fragment. 



10 



10. A DNA construct according to claim 9 wherein the 
antibody fragment is selected from the group consisting 
of VH, VL, CH, CL, VH-CH and VL-CL. 



5 
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11. A DNA construct according to claim 8 wherein the 
first and/or second peptides or polypeptides comprise 
an amino acid sequence encoded by a synthetic 
nucleotide sequence. 

12. A DNA construct according to any one of the 
preceding claims wherein the intron is a self -splicing 
group I or group II intron. 



13 . A DNA construct according to claim 12 wherein the 
self -splicing intron is obtainable from Tetrabymena 

10 thermophila nuclear pre-rRNA. 

14. A DNA construct according to any one of the 
preceding claims wherein the site-specific 
recombination sequence is the io2£p sequence obtainable 
from coliphage Pi, or a mutant or derivative ' thereof . 

15. A DNA construct according to any one of the 
preceding claims wherein the product peptide or 
polypeptide comprises a surface component of an 
organism. 

16. A DNA construct according to claim 15 wherein the 
organism is a bacteriophage. 



17. A DNA construct according to claim 16 wherein the 
bacteriophage is fd or M13 . 
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18. A DNA construct according to claim 17 wherein the 
surface component is the gene III product. 

19. A DNA construct according to any one of the 
preceding claims wherein the product peptide or 
polypeptide does not comprise a polypeptide comprising 
a domain (a) which comprises a binding region of an 
immunoglobulin heavy chain variable region and a domain 
(b) which comprises a binding region of an 
immunoglobulin light chain variable region, the domains 
(a) and (b) of the polypeptide being linked and capable 
of inter-molecular association in a multimer to form an 
antigen binding site but incapable of intra- molecular 
association to form an antigen binding site. 



20. A DNA construct according to any one of the 

> preceding claims which is a vector further comprising 
nucleic acid for expression of the product peptide or 
polypeptide . 

21. A DNA construct according to claim 20 further 
comprising nucleic acid for secretion of the product 
peptide or polypeptide. 



0 



22. A DNA construct according to claim 20 or claim 21 
wherein said vector is a plasmid, a phage or a phagemid 
vector. 
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23. A host cell comprising a DNA construct according 
to any one of the preceding claims. 

24. A plurality of DNA constructs according to any 
one of claims l to 22 collectively encoding a 
repertoire of product peptide or polypeptides wherein 
each product peptide or polypeptide in the repertoire 
has a different amino acid sequence. 

25. A population of host cells comprising a 
plurality of DNA constructs according to claim 23. 

10 26. A method of producing a product peptide or 

polypeptide which comprises a combination of a first 
peptide or polypeptide component and a second peptide 
or polypeptide component, the method comprising: 

providing a DNA construct according to any one of 
claims 20 to 22; 

transcribing DNA of the construct into RNA; 
causing or allowing splicing of nucleotides of 
the third sequence to produce an RNA molecule encoding 
the product peptide or polypeptide; 

translating the RNA molecule into the product 
peptide or polypeptide. 

27 . A method according to claim 26 wherein 
transcription, splicing and. translation take place in 
vitro. 
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« 

28. A method according to claim 27 wherein 
transcription, splicing and translation take place j 
vivo. 

29. A method according to claim 28 wherein 
transcription, splicing and translation take place i 
E. coli cells. 



30. A method according to any one of claims 26 to 29 
wherein a plurality of DNA constructs is provided for 
transcription, splicing and translation. 

10 31. A method according to any one of claims 26 to 30 
wherein, following said translation, product peptide or 
polypeptide of interest is selected or isolated from 
other peptides or polypeptides present. 

32. An isolated nucleic acid construct consisting 
15 essentially of a sequence of nucleotides encoding a 

self-splicing intron with a site-specific recombination 
sequence within the intron. 



33. A nucleic acid construct according to claim 32 
wherein the site-specific recombination sequence is the 
loxP sequence obtainable from coliphage 1, or a mutant 
or derivative thereof. 
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34. A nucleic acid construct according to claim 32 or 
claim 33 wherein the self -splicing intron is obtainable 
from Tetrahyznena thermophila. nuclear pre-rRNA. 
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